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For nearly three decfides researchers have atten^ted to establish 
which acoustic cues are used by language users to discriminate between 
speech sound 3. One such acoustic cue has been identified by Lisker and 
Abramson (1964) as voice onset time (VOT)-'the ti.me between the release 
v' rrt and the onset of Laryngeal pulsing. VOT has been shown to be a 
sufficient cue for the discrimination between a number of American English 
voiced and voiceless stop consonants such as /b/ and /p/» /d/ and /t/, and 
frj and M, 

Libertnan, Cooper, Shankweiler and Studdart-Kennedy (1967) have 
demonotrated that although VOT is a continuum, adult perception along that 
continuum appears to be categorical. That is» adults can readily discrim- 
inate two sounds that lie across the phoneme boundxy of '*'30 msec VOT (e.g., 
/ba/ ^'ersuQ /pa/) but perform only at chance le«'el when asked to discrim- 
inate letween sounds which represent acoustic variations within the same 
phor.er..? catefory (e.g., /ba/ with a 0 msec VOT versus /ba/ with 20 msec 
VOT). Since adui-. Jisteners cannot hear xntraphonemic differences, all 
variations within the phoneme category are assigned the same label by adu)^£ 

Consequently, adult perception for speech sounds appears to be quali- 
tatively different from their perception of non-speech sounds such as 
noises or musical Lon^^s. More recently, several researchers have suggested 
that thi.s ability *.o categorically perceive speech sounds is already pre- 
sent in the early months of life. 

Eimas, Siquelanti, Jusczyic and Vigor ito (1971) have reported finding 
categorical perception m one and four-month-old infants in a study which 
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employed non-nutritive sucking as a response measure. Infants, like adults, 
wtre able to discriminate between the voiced and voiceless stop consonants 
/b/ and /p/ but were unable to discriminate between sounds Wi-thin a phoneme 
category. Similar findings have been reported by Morse (1971), Trehub 
and Rabinovitch (1972), and Treub (1973). 

In addition to these beliavioral data, Molfese (1972), using evoked 
potential recording procedures reported that infants, children and adults all 
responded to speech stimuli with larger amplitude electro cortical activity 
located in different areas of the brain than to non-speech auditory stimuli. 
The brain appears to be specialized to differentiate speech materiaU ftca 

other sounds early in life. 

Thus, some perceptual language skills appear to be available to the 
mfanl from birth. Furthermore, these skilU are con^rable, both behav- 
icrally and neurologies Lly to those of adult language users. 

This position has been questioned by Butterfield and Catrns (1974). 
They note that infant categorical perception of phonemes ^s asymmetrical. 
Infant discrimination across the phoneme bcundry is beticr if the VOT 
values increase tiom one stimulus to the next (e.g., 20 to 40 msec) than 
if they decrease (e.g., 40 to 20 msec). It mfant perception is asymmetrical 
while adult perception is not, this behavioral difference may indicate that 
infent and adult speech perception 8i^i.lls are not as comparable as is 

currently thought. 

A siudy was conducted by the present authcrs to measure the symoetry 
of adult categorical ph neme perception. 
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Method 

Subjects: 

Ten adult nale undergraduate students enrolled in an Introductory 
Psychology class participated m this study as part of their course 
requirements. 
Stimuli: 

The stimuli employed in the present study were prepared at Haskins 
Laboratories by means of a parallel rt^sonance synthesiser and computer. 
The biisbial stop consonants generated had voice onse; t imes of 0, 20, 40, 
60, and 80 milliseconds (the interval between release ^ rst and laryngeal 
pulsing or voicing). Each consonant was followed by i..e vowel /a/. 
All stimuli were 300 msec in duration. 

Thirty-nine randomly ordered sets of ten stimuli were constructed. 
Within each set the stimuli were presented at a rate of one per second. 
A seven second gap separated each set. 

Fifteen of the sets contained a series of ten identical stimuli 
while the remaining 24 sets contained VOT changes of 20 msec in either 
the fourth, sixth or eighth stimulus. For example, the first five 
Stimulus sounds in one series of ten stimuli might be bilabial stop 
consonants with 20 msec VOTs while the second five sounds of the same 
series would be bilabial stop consonants with 40 msec VOTs. Twelve of 
these "VOT change" sets were characterized by an increase in VOT (e.g., 
0 to 20, 20 to 40, etc.) and 12 contained a decrease (e.g., 20 to 0, 
40 to 20, etc. ). 
Procedure : 

Each S was tested individually in a sound dampened room. Subjects 

3 
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were seated at a lable feeing two speakers: one Located 90 inches fnm the 
subject, the other Located 6 inches direct Ly behind the first. A response 
key vas positioned direct Ly in front of the subject ♦ 

The speech stimuLi were deLiverod ihruugh the farthest speaker at 
83 db (SPL) as measured at the subject ear. White noise was presented 
continousLy at 65 db (SPL) through the front speaker to mask background 
ambient noise* The subjects were instructed to Listen to each set and to 
press a Icey lu indica**e the position (>f a perceived change* The subjects 
were toLd that some sets wouLd contain obvious, subtLe or no changes. 

ResuLts 

To insure the quality uf the stimuLl synthesis, L5 additional adult 
subjects were administered the LabeLing and ABX discrimination tasks of 
Libertnan et aL (L967). Figure one presents the LabeLing data that were 
thus obtained. 

Insert Figure One 

As can be seen from this figure, the stimuli were identified as /ba/ 
when the VOT vaLues were? 0 or 20 msec. Wh^n VOT vaLues were 40, 60 or 80 
msec, the stimuli were identified as /pa/. This finding replicates those 
reported by Liberman et al (L967). 

Figure two presents the discrimination data obtained using an 
ABX paradigm. 

Insert Fi-^ure Two 
This figure indicates the percentage of correct responses illicited 
by various stimuli changes. The «bscissa values present the VOT values 
for the different stimuli. 
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The abscii sa values are the starting points of ascending VOX changes 
(e.g., 0 is the starting point of a 0 to 20 VOT change). Each abscissa 
value is also the terminal point of a descending VOT change (e.g., 0 is the 
terminal point of a 20 to 0 VOT change). Thus, each abscissa has two 
ordinate values, one for ascending and one for descending VOT clianges. 

The ABX date clearly indiijate categorical perception. Correct 
responding dra^ticalLy increased when VOT changes transgressed the +30 
msec boundry. Perception within phoneme categories was at chance level. 
In addition no asynunet^y was found. Perception was equal for ascending and 
descending VOT changes.- \ 

In general these findings replicate those reported earlier by 
Liberraan et al (1967). 

The data generated by the ten-item 'change detection" paradigm 
described earlier were analyzed using a repeated measures ANOVA. Alpha 
was set at .01 for all statistical analyses. 

The analysis indicated significantly better discrimination for 
changes across the phoneme boundry than within adult phoneme phoneme 
categories (F= 21.036). There was also a significant interaction between 
place of VOT change (across or within boundry) and direction of change 
(asending or descending) (F= 5.375). 

Figure three presents the discrimination function obtained from 
the change detection .paradigm. 

Insert Figure Three 

This figure is read in the same way as was figure two. This figure 
indicates that the significant effect of place of VOT change derives fifoo 

r 
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the fact that perception is significantly better across the phoneme 
boundry than \nthin categories. 

The significant place x direction interaction indicates an asymnetiy. 
Scheffe^s test was computrl tu test for significant differences between the 
8 points on figure three. This ana. s indicated that perception of VOT 
changes that descended across the boundry (from 40 to 20) was significantly 
better tlian any other type of VOT cliange including changes ascending across 
the boundr/ (from 20 to 40 msec). The only < ther significant differences 
were between ascending acr-)ss boundry changes (from 20 to 40 msec) and 
descending within category changes (from 40 to 60 and from 60 to 80 n&ec). 
The responding to the ascending VOT change from 40 to 60 msec was at 10% 
which is precisely at chance level. It is noted that the ascending across 
boundry VOT change (from 20 to 40 msec) approached but did not reach a 
Level of statistically significant difference when compared to this 
chance level. 

Discussion 

A task was employed to assess categorical perception different fron 
the standard ABX paradigm Chance responding in the standard ABX task is 
at the 50% level as compared to only 10% in the present task. Using this 
task, we were able to replicate the adult categorical discrimination effect. 
However,' the data indicate that the effect is asymmetrical. Perception 
is better if a decrease is made in the VOT across the boundry than if it 
increases. 

Butterfield and Cairns (1974) report an asymmetry of categorical 

perception for infants in the opposite direction to that reported here 

L 
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with adults. The data of those researchers indicate superior perc-otion 
for ascending VOT changes from -20 msec to +80 msec. The lack of agi-eement 
between the present study ana that of ButterfieLd and Cairns may stem from 
several sources • First, the present study employed 20 msec VOT changes 
as opposed to a LOO msec change. Whethor different steps in the VOT change 
will affect the asymmetry is an emperical question that deserves further 
study. A second difference is that the ButterfieLd and Cairns study 
measures discriminations be^^ween pre voicing and delayed voicing. It is 
not clear to the present authors that this procedure is necessarily 
comparable to comparing different voicing lags. The former discrimination 
may assess different abilities. A third possible source of the difference 
between results may be the use of different response measures. Infant 
discriminations were inferred from changes in high amplitude sucking. The 
adult subjects in the present study were required to respond as if operating 
in a signal detection situation. Consequently, the grossly different 
response measures may accouiv for the discrepancy in the data. A final 
point is that the data my reflect actual developmental differences in the 
perception of speech cues. 

In any event, the exislenc« of asymmetry in categorical perception 
by adults and children may indicate that the skills employed by each are at 
some level cornparable t' each jtUer. The data tend lo support a position 
for an innate basis for language. 

A crucial qualification needs to e/.preised. The phenomenon of 
categorical perception may be dependent on acoustic and physiological variables 
as well as )n linguistic significance. This is indicated by the asymmetry 

1 
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which has been found. The dist inctiun between two phonemes, if solely 
determined by the linguistic significance of each^ would be equal re- 
^rdless of which phoneme is presented first Thus, hearing /ba/ or 
/pa/ first should uot sigiiif icantly alter the discrimination between them. 
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Figure 1 

Identification of Synthetic Spe&ch Stimuli 
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Discrimir^aticn cf Stimili usin^: an AB> paradigm 
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2iscrir5ination of S':.ir:uli usin^ a lO-item scries paradigm 
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